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Abstract 



The usual way to investigate the statistical properties of finitely gen- 
erated subgroups of free groups, and of finite presentations of groups, is 
based on the so-called word-based distribution: subgroups are generated 
(finite presentations are determined) by randomly chosen /c-tuples of re- 
duced words, whose maximal length is allowed to tend to infinity. In this 
paper we adopt a different, though equally natural point of view: we inves- 
tigate the statistical properties of the same objects, but with respect to the 
so-called graph-based distribution, recently introduced by Bassino, Nicaud 
and Weil. Here, subgroups (and finite presentations) are determined by 
randomly chosen Stallings graphs whose number of vertices tends to infin- 
ity 

Our results show that these two distributions behave quite differently 
from each other, shedding a new light on which properties of finitely gener- 
ated subgroups can be considered frequent or rare. For example, we show 
that malnormal subgroups of a free group are negligible in the graph- 
based distribution, while they are exponentially generic in the word-based 
distribution. Quite surprisingly, a random finite presentation generically 
presents the trivial group in this new distribution, while in the classical 
one it is known to generically present an infinite hyperbolic group. 

Keywords: subgroups of free groups, finite group presentations, statistical 
properties, Stallings graphs, partial injections, malnormality 

MSG: 20E05, 05A15, 20F69 

1 Introduction 

Statistical properties of elements and subgroups of free groups have evoked 
much interest in recent years, especially after Gromov's famous claim ^14^ 0.2. A] 
that "most" groups were hyperbolic, which led to precise statements and proofs 
by Ol'shanskii [28] and by Champetier [HI [9]. Shortly thereafter, Ol'shanskii 
and Arzhantseva [2| fT| pursued the study of the statistical properties of finite 
presentations of groups, that is, largely, of finitely generated normal subgroups 
of free groups. We refer the reader to the survey by Ollivier [27] for more details. 

This interest encountered another historical trend in combinatorial group 
theory, namely the consideration of algorithmic problems, which leads naturally 
to an interest in the evaluation of the complexity of these algorithms (e.g. 
[5| 121^ [29]) and in enumeration problems. 

The search for innovative group-based cryptographic systems (see [25] for 
instance) only reinforced the study of complexity questions, and focused it on 
the investigation of the statistical properties of finitely generated subgroups of 
free groups, notably via the notion of generic complexity (see [18] [TS]). 

The usual method to approach statistical properties is to enumerate the 
objects under consideration, or more precisely, representatives for these objects, 
in a stratified way. For instance, if we wish to investigate fc-generated subgroups 
of Fr (resp. finitely presented groups with r generators and k relators), we 
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proceed by enumerating lists of fc-tuples of generators (resp. relators) over a 
fixed alphabet of r letters, so that at level n one has enumerated all such k- 
tuples whose elements have length at most n. In the situation we will consider, 
there are only finitely many objects of a given level n and it makes sense to 
ask what proportion of level n objects satisfy a given property. This gives us 
a number pn between and 1 for each n, associated to the given property and 
one can ask whether this sequence has a well defined limit. If the limit exists 
and is equal to 1, we would say that the property is generic, and take this to 
mean that most objects satisfy the property. At the other extreme, if the limit 
of the Pn equals 0, we would say that the property in question is negligible and 
conclude that it is rarely encountered amongst our objects. 

A crucial observation, which is well worth mentioning in view of the intuitive 
weight carried by expressions such as most objects or rarely encountered, is that 
genericity and negligibility depend essentially on the choice of the stratification: 
different stratifications of the same objects, say finitely generated subgroups of 
free groups, will bring to light different insights on the statistical behavior 
of these objects. Concretely, different properties will appear to be generic or 
negligible. 

Up to recently (namely the publication of [3j ) , the literature was unanimous 
in adopting the representation of finitely generated subgroups of free groups by 
fe-tuples of generators, stratified by their maximal length - which we call the 
word-based distribution. 

It is the purpose of this paper to question this unanimity. The basic idea is 
that there exists another very natural representation of finitely generated sub- 
groups of free groups, by their Stallings graph ([32], see Section [2T]l . Stratifying 
finitely generated subgroups by the size (number of vertices) of their Stallings 
graph - what we call the graph-based distribution - indeed sheds a different 
light on which properties of subgroups are frequent or rare. One of our main 
results is that malnormality and purity, which are generic in the word-based 
distribution, are negligible in the graph-based distribution (Section d]). 

We also exhibit a property of finitely generated subgroups of Fr that is neg- 
ligible in the word-based distribution and that has a non-zero, non-one asymp- 
totic probability (namely e~^) in the graph-based distribution (Section [5]). 

Finally we explore the possibility of using the graph-based distribution to 
discuss the statistical properties of finitely presented groups. The results there 
are disappointing: it turns out that finitely presented groups are generically 
trivial in this distribution - quite differently from the word-based distribution 
in which they are known to be generically infinite and hyperbolic (Section [6]). 

Sections [2] and [3] are devoted to preliminaries on genericity and to a review 
of the main features of the word-based and the graph-based distributions for 
finitely generated subgroups of free groups. 

2 Preliminaries 

Here we summarize standard facts about the Stallings graphs of subgroups (in 
Section [2. ip and we review the notions of generic and negligible properties. 
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Throughout the paper, A denotes an alphabet, that is, a finite non-empty set 
and F{A) denotes the free group over A. The elements of F(A) are represented 
by the reduced words written using letters from A and their formal inverses 
{a~^ I a £ A}. If r > 1, we often use the notation Fr instead of F{A), to 
indicate that A consists of r letters. Throughout the paper, we will in fact 
assume that r > 2. 

We denote by [n] (n > 1) the set {!,..., n}. 

2.1 Subgroup graph representation 

Each finitely generated subgroup of F[A) can be represented uniquely by a 
finite graph of a particular type, by means of the technique known as Stallings 
foldings [32]. This representation has been used by many authors, frequently 
using combinatorial, graph-theoretic notations that slightly differ from those 
used by Stallings. It is this formalism that we also use, which can also be found 
in [Hll [T71 ESI [22]. The procedure of Stallings foldings is informally described 
at the end of this section. 

An A-graph is defined to be a pair T = {V, E) with E ^ V x A x V , such 
that 

• if (u, o, w), (m, a, v') S E, then v = v'; 

• if (u, a, v), (u' ,a, v) G E, then u = u' . 

The elements of V are called the vertices of F and the elements of E are its edges. 
We say that F is connected if the underlying undirected graph is connected. If 
V G y, we say that f is a leaf if v occurs at most once in (the list of triples 
defining) E and we say that F is v-trim if no vertex w ^ v is a leaf. Finally 
we say that the pair (F,f) is admissible if F is a finite, ii-trim and connected 
^-graph. Then it is known (see [32l [Ml [13 [22] ) that: 

• Stallings associated with each finitely generated subgroup H of F{A) a 
unique admissible pair of the form (F, 1), which we call the graphical 
representation or the Stallings graph of H and write T(H); 

• every admissible pair (F, 1) is the graphical representation of a unique 
finitely generated subgroup of F(A); 

• if (F, 1) is the graphical representation of H and « is a reduced word, then 
u £ H if and only if u labels a loop at 1 in F (by convention, an edge 
{u, a, v) can be read from u to v with label a, or from v to u with label 

• if (F, 1) is the graphical representation of H, then rank(ff) = \E\ — |y| 

• finitely generated subgroups H and K are conjugates if and only if the 
cyclic cores of T{H) and T{K) (obtained by repeatedly deleting leaves 
and the edges they are adjacent to) are equal. 
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We informally remind the readers of the computation of the graphical rep- 
resentation of a subgroup generated by a subset B = {ui, . . . ,Uk}- It consists 
in building an {A U 74~^)-graph, changing it into a ^-graph, then reducing it 
using foldings. First build a vertex 1. Then, for every word u of length n in 
B, build a loop with label u from 1 to 1, adding n — 1 vertices. Change every 
edge (u,a^^,v) labeled by a letter of A^^ into an edge {v,a,u). Then itera- 
tively identify the vertices v and w whenever there exists a vertex u and a letter 
a E A such that either both {u, a, v) and {u, a, w) or both (f , a, u) and {w, a, u) 
are edges in the graph (the corresponding two edges are folded, in Stallings' 
terminology) . 

The resulting graph V is such that (F, 1) is admissible and, very much like 
in the (1-dimensional) reduction of words, it does not depend on the order used 
to perform the foldings. 



2.2 Negligibility and genericity 

Let S" be a countable set, the disjoint union of finite sets 5„ (n > 0), and let 
Bn = Ui<n'^»- Typically in this paper, S will be the set of Stallings graphs, 
of partial injections, of reduced words or of fc-tuples of reduced words, and Sn 
will be the set of elements of S of size n. A subset X of 5 is negligible (resp. 
generic) if the probability for an element of Bn to be in X, tends to (resp. to 
1) when n tends to infinity; that is, if lim„ ^"'j^'^"^ = (resp. = 1). 

Naturally, the negligibility or the genericity of a subset X oi S depends on 
the layering of S into the Sn- In particular, if X and its complement are both 
infinite, then an appropriate partition of S into finite subsets 5„ will make X 
negligible, another will make it generic, and indeed, another will be such that 
lim„ I g"^" ^ ~ P ^o'^ fixed Q < p <1. 

Thus, any discussion of negligibility or genericity must clearly specify the 
distribution that is considered, that is, the choice of the partition (<S'„)„. 



2.2.1 Rate of convergence 

In general, we may be interested in the speed of convergence of - towards 

if X is negligible and towards 1 if it is generic. One reason is that a higher 
speed of convergence indicates a higher rate of confidence that a randomly 
chosen element of S of size n will miss X if X is negligible, or will be in X if 
X is generic, even for moderately large values of n. 

If a class J- of functions tending to is closed under max (of two elements) , 
we say that a subset X is T -negligible if ^^^"^"^ = 0{f{n)) for some f ^ T. 
We also say that X is T -generic if the complement of X is J^-negligible. Note 
that J^-negligible (resp. J^-generic) sets are closed under finite unions and 
intersections. 

Much of the literature is concerned with exponential negligibility or gener- 
icity, namely ^"-negligibility or genericity where T is the class of functions e""^" 
(OO). 
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2.2.2 Balls versus spheres 

The definition of negligibility and genericity above is given in terms of the balls 
Bn- the sets of elements of size at most n. It is sometimes more expedient to 
reason in terms of the proportion of elements of X in the spheres Sn- let us say, 
within the ambit of this section, that a set X is S-negligible (resp. S-generic) 
if the ratio ' | g | tends to (resp. 1). The definition of J^-S-negligibility or 
J^-S-genericity is analogous. We verify in this section that (exponential) S- 
negligibility implies (exponential) negligibility. The same holds of course for 
genericity. 

Proposition 2.1 An S-negligible (resp. S-generic) set is also negligible (resp. 
generic). 

If the structures under consideration grow fast enough, so that lim = 0, 
then the same result holds for exponential negligibility and genericity. 

The proof of this statement relies on the following technical lemma. 

Lemma 2.2 Let (an) and (6„) be increasing sequences of positive real numbers. 

(1) (Stolz-Cesaro theorem) If lim6„ = oo and lim ^""""^I^" = 0, then 
lim ^ = 0. ""^^ 

(2) If (^) and ( ^""""^"^^ ) converge to exponentially fast and if an < bn for 
each n, then (|^) converges to exponentially fast as well. 

Proof. (1) Since lim = 0, for each e > 0, there exists no such that 

a„+i — a„ < e{bn+i — bn) for all re > uq. Summing these inequalities for all 
integers between n — 1 and ng, we find that a„ — a„Q < e{bn — bn^ ) for all n > uq. 
Dividing by 6„ and using ttiG fact tb.a-t lim bn = oo, we conclude that ^ < 2e 
for all large enough n. 

(2) Our hypothesis is now that there exists c > such that a„+i — a„ < 
e~^"'{bn+i — bn) for all re > reo. Summing these inequalities for the integers 
between re and 2re — 1, we find that 02n — On < e~'^^{b2n — bn) for all re > reo. 
We now divide both sides by bon and use the fact that #^ < and that this 
sequence converges to exponentially fast to conclude that (f^)n converges 
to exponentially fast. Summing instead for the integers between re and 2re 
and dividing by 62n+i shows that (f^^^)n converges to exponentially fast as 
weh. " □ 

Proof of Proposition 12. ll Let X (1 S, On = \Xr\ Bn\ and 5„ = \Bn\. Then 
On — CLn-i = l-'^ H S'nj and bn — bn-1 = \Sn\. The statement on (exponential) 
negligibility now follows directly from Lemma [2 .21 The statement on genericity 
follows as well, since generic sets are the complements of negligible sets. □ 
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3 The word-based and the graph-based distributions 



In order to discuss the distribution of finitely generated subgroups of Fr, we 
need to fix a representation of these subgroups by means of discrete structures. 
In this paper we consider two such structures: a subgroup can be given by a 
tuple of generators (reduced words in Fj.), or by its Stallings graph (Section l2.ip . 
In the first case, the size of the representation is the pair (fc, n) where k is the 
number of generators and n their maximal length - or n if A; is fixed; in the 
second case, the size of the representation is the number n of vertices of the 
Stallings graph. In either case, there are only finitely many subgroups of each 
size. 

We first review the literature on the word-based and the graph-based dis- 
tributions (Sections 13.11 and 13. 2p . and then start the discussion of negligible or 
generic properties of subgroups (Section 13. 3p . 

3.1 The word-based distribution 

The distribution usually found in the literature (e.g. [18l [151 US]) is in fact a 
distribution on the /c-tuples /i = (/ii, . . . , /ifc) of reduced words of length at most 
n, where k is fixed and n is allowed to grow to infinity; one then considers the 
subgroup H generated by h. We call this distribution word-based. 

Let us first record three elementary facts, which can also be found in [ 15^1 . 
We denote by Rn the set of reduced words of length at most n. 

Fact 3.1 = ^—(2r - 1)" f 1 — - — — V □ 

Proof. The number of reduced words of length z > 1 is 2r(2r — 1)*^^, so the 
cardinality of Rn is 

, , . ^ ,• 1 (2r - 1)" - 1 
\Rn\ = l + Yl 2K2r - 1)'"' = 1 + 2r ^ ^ ^ 

i=l 

= l + ^((2r-ir-l) 
r — 1 

-(2r-l)" (l 



1^ ^ V r(2r-l)" 



□ 



Fact 3.2 Let < a < 1. Exponentially generically, a reduced word in Rn has 
length greater than an. □ 

Proof. The proportion of words in of length less than or equal to an, is 

l^lanll -^(2r- l)L"'^J(l + o(l)) , , 

\Rn\ - ^(2r-l)«(l + o(l)) ^^^"^^^^ 

< (2r-l)("-i)"(l + o(l)). 



^We choose to reiterate the proofs of these results, because we feel that our presentation 
exhibits more clearly their combinatorial underpinnings. 
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Since a — 1 < 0, it converges to exponentially fast. 



□ 



Let h = {hi, . . . , hk) be a tuple of reduced words and let n > such that 
min|/ij| > 2[/i]. We denote by Pref^{h) the set of prefixes of length at most 
In] of the hi and h~^ . 

Fact 3.3 Let < A < ^. Exponentially generically, a fc-tuple (/ii, . . . , /i^) of 
elements of Rn, is such that min > 2 [An] and the prefixes of the hi and h~^ 
of length [An] are pairwise distinct. □ 

Proof. The complement in of the set of fc-tuples described in the statement 
is the union of the set Y of fe-tuples h where min \ hi\ < 2 [An] , and of the set Z 
of /c-tuples where min \ hi\ > 2 [An] and the set of prefixes of length [An] of the 
hi and h~ has at most 2k — 1 elements. Since 2A < 1, the set Y is exponentially 
negligible by Fact 13.21 and we now concentrate on Z. 

For each integer 2 [An] < m < n, let Z^a be the set of /c-tuples in Z, such 
that min \ hi\ = m. Then 

\Zm.\< (2r(2r - i)rAnl-i)2fc-i f^^2k - 1) (2r - l)H-rri-2\Xn]) 
< {2rf''-^ k{2k - 1) (2r - l)k{m-2)-\Xn]+l _ 

Summing these inequalities for all 2 [An] < m < n, we find 

\Z\ < (2r)2'=^i k{2k - 1) (2r - i)Kn-i)-\Xn]+i _ 

As a result, the proportion of A;-tuples in Z is at most 



(2r)2fc-i k{2k - 1) (2r - i)Kn-i)-\\n] + 



-(2r-l)*^"(l + o(l)) 



< C7 (2r- l)-^"(l + o(l)) 



for some constant C depending only on k and r. Thus, this proportion converges 
to exponentially fast. □ 



Remark 3.4 The small cancellation property, a closely related statement rel- 
ative to common factors located anywhere in the words hi and h^^ (not just at 
their extremities), is discussed in Lemma 14.51 in a variant of Arzhantseva and 
Ol'shanskii's result on cyclic words [21 Lemma 3]. □ 



Let < a < 1 and < A < ^, and for each n, let Ya,A,n,fc be the set of 
fe-tuples h = (/ii, . . . , /ifc) G R^ such that min > an and the prefixes of the 
hi and h^^ of length [An] are pairwise distinct. Facts [3?2] and [331 show that 
the proportion of elements of R^ in Ya^\^n,k converges to 1 exponentially fast: 
in the search for exponentially generic properties of subgroups, we can restrict 
our attention to the tuples in Ya^x^n,k and to the subgroups they generate. 

The following observation is the basis for our exponential genericity proofs, 
in the context of the word-based distribution. 
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Fact 3.5 Let a, A satisfy < 2A < a < 1. If h e YaXn,k and H = (h), then 
T{H) consists of two parts: 

- the vertices at distance at most [An] from the distinguished vertex and the 
edges connecting them: this forms a tree with vertex set PrefAn(^)) and edges 
u — ua\i a ^ A and n, ua G PrefAn(/i); this tree, which we call the central part 
of T{H), has 2k leaves; 

- and for each 1 < i < A;, where hi = pirriiSi and \pi\ = \si\ = [An], a path 
labeled mj from the vertex pi to the vertex (both are in the central part); 
we call these paths the outer loops. □ 

This leads to the following results. Propositions 13.6 1 and 13.81 first appeared 
in a paper by Jitsukawa ^15j. They are direct consequences of earlier analo- 
gous results (counting cyclic words instead of words) due to Arzhantseva and 
Ol'shanskii [21 Lemma 3]. 

Proposition 3.6 Exponentially generically, a k-tuple of elements of Rn gen- 
erates a subgroup of rank k. 

Proof. Let a, A satisfy < 2A < a < 1. As observed above, it suffices to show 
that if /i G ^,A,n,fcj then H = (h) has rank k. In that case, using Fact 13. 5^ we 
find that T(H) is formed of a central part and k outer loops. 

The central part is a tree and like all trees, the number of its edges is 1 
less than the number of its vertices. With the notation of Fact 13.51 the number 
of additional vertices (resp. edges) in the outer loops is X]j(|m,i| — 1) (resp. 
Y^- \mi\). Therefore, in T{H), we have \E\ — \V\ + 1 = k and hence rank{H) = k 
(see Section [2T]) . □ 

Proposition 3.7 Exponentially generically, a k-tuple h and an k' -tuple h' of 
elements of Rn generate subgroups that are distinct, have trivial intersection, 
and are such that {h,h') = (h) * {h'). 

Proof. Since the first k components of a (/c + A;')-tuple of elements of Rn 
are independent from the k' last components, and since such a (fc + /c')-tuple 
exponentially generically generates a subgroup of rank k + k' (Proposition 13. 6p , 
we find that a fc-tuple and an fc'-tuple of elements of Rn exponentially generically 
generate their free product. This in turn implies the other properties. □ 

Proposition 13.71 shows that two A:-tuples of elements of Rn exponentially 
generically generate distinct subgroups. Proposition 13.81 is a little more precise. 

Proposition 3.8 Let a, A satisfy < 2A < a < 1. The k-tuples h and h' 
in Ya^\^n,k generate distinct subgroups, unless h' = (^^^(i); • • • ; ^^^(fc)) for some 
permutation a of [k] anrf /or ei, . . . , G { — 1, +1}. 

Proof. If {h) = {h'), then the graphs T{h) and T{h') are equal. In particular, 
their central parts, formed by the vertices at distance at most [An] from the 
distinguished vertex, coincide. Fact 13.51 shows that if /i G i^a,A,ra,fc, then T{h) 
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completely determines all the A;-tuples h' G Ya^\^ri,k such that T{h') = r(/i), and 
that they coincide with h up to the order of the elements and the direction in 
which the outer loops are read. □ 

Proposition 13.81 shows that, if we consider the class Sa,x,n.k of subgroups 
generated by A;-tuples in Ya^\^n,k^ then each subgroup occurs the same number 
of times, namely 2^k\. Randomly choosing a /c-tuple in Ya x^n,k yields therefore 
a random subgroup in Sa,\,n,k, and the proportion of these subgroups among 
all subgroups generated by a fc-tuple of words of length at most n tends to 1 
exponentially fast. 

3.2 The graph-based distribution 

The uniform distribution on the set of size n Stallings graphs was analyzed 
by Bassino, Nicaud and Weil f3]. Here we summarize the principles of this 
distribution and the features which will be used in this paper. 

We already noted that in Stallings graphs, each letter labels a partial injec- 
tion on the vertex set: in fact, a Stallings graph can be viewed as a collection 
{fa)a£A of partial injections on an n-element set, with a distinguished vertex, 
and such that the resulting graph (with an a-labeled edge from i to j if and 
only if j = fa{i)) is connected and has no vertex of degree 1, except perhaps the 
distinguished vertex. We may even assume that the n-element set in question 
is [n] = {!,..., n}, with 1 as the distinguished vertex, see [3l Section 1.2] for 
a precise justification. In particular, the automorphism group of an admissible 
pair (r, 1) is always trivial. 

One shows [3, Corollary 2.7] that the probability that an A-tuple (/a)aeA of 
partial injections on [n] induces a Stallings graph tends to 1 as n tends to infinity, 
and the problem of randomly generating a Stallings graph then reduces (via an 
efficient rejection algorithm, see O Section 3]) to the problem of efficiently 
generating a random partial injection on [n\. This view of a Stallings graph as 
an 74-tuple of partial injections on [n] is central in our analysis. 

The maximal orbits of a partial injection / (equivalently: the connected 
components of the function graph of /) can be of two kinds: cycles - where 
each element is both in the domain and in the range of / - and sequences. The 
size of each of these components is defined to be the number of vertices which 
they contain. It is this combinatorial view of partial injections ~ as a disjoint 
union of cycles and sequences -, which is at the heart of the random generation 
algorithm, obtained using the so-called recursive method I12j. 

The distribution of sizes of components is studied in [3| Section 3] , as well as 
the distribution of cycles vs. sequences among size k components. The random 
generation algorithm consists in drawing a size of component, say A;, according 
to the relevant distribution; then drawing whether this size k component is a 
cycle or a sequence; and finally drawing a partial injection on the remaining 
n — k elements [H Section 3.1]. This results in a partial injection on an n- 
element set, and we need only add a random numbering (1 through n) of the 
elements of that set. 
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However complex the method may seem, it guarantees a uniform distribu- 
tion among all size n partial injections, it is easy to implement and its average 
time complexity is linear (in the RAM model; it is 0{n'^ log n) under the bit-cost 
assumption) [3l Section 3.3]. 

To further discuss partial injections and other combinatorial structures, we 
use the notion of exponential generating series, written EGS. If 5 is a class of 
finite discrete structures such that there are finitely many 5-structures of each 
size, let Sn be the number of 5-structures of size n. The EGS of S is the formal 
power series S{z) = X^„>o 

Let I{z) = Yln>o 1^^"' be the EGS of partial injections. Bassino, Nicaud 
and Weil show the following [3l Section 2.1 and Proposition 2.10]. 

Proposition 3.9 The EGS I{z) of partial injections satisfies the following 

This result is obtained by means of deep theorems from analytic combina- 
torics. The same methods can be used to study the asymptotic behavior of 
particular parameters, such as the number of sequences of a partial injection. 
This parameter is directly connected with the number of edges in the Stallings 
graph formed by the partial injections fa (a € A), which leads to the following 
result [3l Lemma 2.11 and Corollary 4.1]. 

Proposition 3.10 The expected number of sequences in a randomly chosen 
partial injection of size n is asymptotically equivalent to \/n. 

The expected rank of a randomly chosen size n subgroup of E,. is asymptot- 
ically equivalent to (r — l)n — r^/n + 1. 

3.3 Negligible and generic properties of subgroups 

Thus, in the discussion of statistical properties of finitely generated subgroups 
of a (fixed) free group Er, we have two distributions at our disposal. One, 
the word-based distribution, is governed by two parameters: the number of 
generators and their maximum length, the former fixed and the latter allowed 
to tend to infinity. The other, the graph-based distribution, is governed by a 
single parameter: the size of the Stallings graph. 

We first observe that our discussion of the graph-based distribution (as well 
as the results in [3J) is in terms of spheres rather than balls: as we saw in 
the Section 12.2.21 the (exponential) negligibility or genericity results obtained 
in that setting are sufficient. In contrast, the existing literature on the word- 
based distribution is in terms of balls, as is our description in Section [3TT] above. 

The graph-based as well as the word-based distribution allow the discussion 
of properties of subgroups (of subgroups of a fixed rank k in the word-based 
case). There is of course no reason why a property that is generic or negligible 
in one distribution should have the same frequency in the other. 

Our two distributions are indeed very different. How different is illustrated 
in Figure [H which shows a "random" size 200 Stallings graph and the Stallings 
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Figure 1: The Stallings graphs of two randomly generated subgroups of F2. On the 
left, a subgroup generated by a random 5-tuple of words of length at most 40. On the 
right, a random Stallings graph of size 200. Only the shape of the graphs is depicted, 
vertices and edge labels and directions are not represented. The pictures have been 
generated by neato. Note that the scale (average distance between two vertices) is not 
the same on the two pictures. 



graph of the subgroup of F2 generated by a "random" 5-tuple of words of length 
at most 40 (which has close to 200 vertices). This figure provides the intuition 
to exhibit properties of subgroups that are negligible in one distribution and 
generic in the other. 

It is not difficult to come up with such properties. It is the case, for instance, 
of the property to have rank i, for a fixed integer i > 1. In the graph-based dis- 
tribution, this property is negligible as a consequence of Proposition 13.101 (see 
[21 Corollary 4.2]). In contrast, it is exponentially generic in the word-based 
distribution of i-generated subgroups, see Proposition 13.61 For the same rea- 
son, it is exponentially negligible in the word-based distribution of k-generated 
subgroups with k ^ £. 

The properties of malnormality and purity, discussed in Section HI provide 
more complex examples of this sort. 

4 Malnormal and pure subgroups 

Malnormality and purity are two important properties of subgroups. A sub- 
group H is pure if G and n 7^ implies x £ H. A pure subgroup is also 
called closed under radicals or isolated. 

The subgroup H is malnormal if H f] = 1 for every g ^ H. Malnormal 
subgroups play an important role in the study of amalgamated products (e.g. 
[191 [6]) and in the characterization of their hyperbolicity pOj. The following is 
elementary from the definition. 

Lemma 4.1 If a subgroup is malnormal, then it is pure. 

Note that the converse statement does not hold: {a,bab^^) is pure, yet not 
malnormal. 

Both malnormality and purity have nice graphical characterizations, which 
imply that these properties are decidable for finitely generated subgroups of 
free groups. The result on malnormality is due to Kapovich and Myasnikov 
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[T7] (following a decidability result in [3]), that on purity is due to Birget, 
Margolis, Meakin and Weil [5]. 

Proposition 4.2 Let (F, 1) he the graphical representation of a subgroup H . 

(1) H is non-malnormal if and only if there exists a non-trival reduced word 
u and distinct vertices x ^ y inT such that u labels loops at x and at y. 

(2) H is non-pure if and only if there exists a non-trival reduced word u, an 
integer n > 2 and a vertex x in T such that u" labels a loop at x hut u 
does not. 

4.1 Genericity in the word-based distribution. . . 

Jitsukawa shows that malnormality is a generic property in free groups [151 
Theorem 4 and Lemma 6]. His arguments can be extended to show that it is 
exponentially generic. 

Theorem 4.3 Malnormality is exponentially generic in the word-based distri- 
bution. 

In view of Lemma 14.1^ we also have the following result. 

CoroUary 4.4 Purity is exponentially generic in the word-based distribution. 

We now proceed to prove Theorem l4.31 The proof relies on the two following 
lemmas, which provide an analogue of a small cancellation property for tuples 
of reduced words. These lemmas constitute a variant of the results of Gromov 
[I4l 0.2. A] for tuples of cyclic words, proved also by Champetier [U E] and by 
Arzhantseva and Ol'shanskii [21 Lemma 3]. 

Lemma 4.5 Let < /3 < 1. The proportion of k -tuples h of reduced words in 
Fj. of length at most n, such that one of the hi contains two distinct occurrences 
of factors v and w of length at least fin, with v = w or v = , converges to 
exponentially fast. 

Lemma 4.6 Let < /? < 1. The proportion of k-tuples h of reduced words of 
length at most n, such that a word v of length at least j3n has an occurrence in 
one of the hi and v or has an occurrence in hj for some j ^ i, converges 
to exponentially fast. 

We can now prove Theorem 14.31 

Proof of Theorem 14.31 Let < a < 1 and < A < f . Let h = {hi, . . . , hk). 
Exponentially generically, we have min | /ij | > an and the prefixes of length [An] 
of the hi and are pairwise distinct (Facts [3?2] and (3^ . In addition, expo- 
nentially generically, no word of length at least '^~^^ n has distinct occurrences 
as a factor of the hi and the h~^ (Lemmas 14.51 and 14. 6p . 
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Let us now assume that h satisfies all these properties. Then T = T{{h)) is 
composed of a central part, which is a tree containing the distinguished vertex 
and all the vertices corresponding to the prefixes of the /ij and h^^ of length 
up to [An], and of outer loops whose labels are factors of the hi (or the h^^, 
depending on the direction in which they are read), see Fact 13.51 

Any loop in F must visit the central part of F at least once, and run along 
at least one of the outer loops. Let us now assume that a word u labels two 
distinct loops in F. Up to conjugation of u, we can assume that the base point 
of the first loop is in the central part of F. Then u has a factor v of length 
\an] — 2 [An], which is a factor of some hi or h~^ . The other occurrence of 
a loop labeled u reveals another path in F labeled v. This path may not be 
entirely in an outer loop, but if it is not, then it visits the central part of F only 
once, so it has a factor v' of length at least "'~^^ n in an outer loop, and hence 
in one of the /i,- or hj^ . This word v' has distinct occurrences as a factor of the 
hi and the h~^ , a contradiction. □ 

4.2 . . . and negligibility in the graph-based distribution 

In contrast, we show that malnormality and purity are negligible in the graph- 
based distribution. 

Theorem 4.7 The probability that a random subgroup of size n is pure is 
0{n-i). 

By Lemma l4.ll this implies the following 

Corollary 4.8 The probability that a random subgroup of size n is malnormal 
is 0{n^2y 

To prove Theorem 14. 7^ we observe that if is a finitely generated subgroup 
of Fr and a cycle of length at least 2 in T(H) is labeled by a power of some 
letter a, then H is not pure (Proposition 14. 2p . Therefore, if a subgroup is pure, 
then the partial injection determined by each letter in A has only sequences 
and length 1 cycles. 

Thus Theorem 14.71 follows directly from the following proposition. 

Proposition 4.9 The probability that a size n partial injection has no cycle of 
length greater than or equal to 2 is asymptotically equivalent to 

Our proof of Proposition 14.91 uses Hayman's theorem, discussed in Sec- 
tion 14.2.11 below. 

Remark 4.10 There are many more reasons for a subgroup to fail to be pure, 
than those considered here. In terms of Proposition |3]2l we have considered only 
the words u that are equal to a letter of the alphabet. As a result, the probability 
of purity and that of malnormality are likely to be much smaller than the upper 
bounds given above. The open question here is whether purity and normality 
are exponentially negligible with respect to the graph-based distribution. □ 
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4.2.1 H-admissible functions and Hayman's theorem 

Hayman's theorem on the asymptotic behavior of the coefficients of certain 
power series requires a technical hypothesis called H-admissibility. Here we give 
only the technical definition and statement we will use, and we refer the readers 
to im Chapter VIII] for further details on this theorem and on saddlepoint 
asymptotics in general. 

Let f{z) be a function of the form f{z) = e^^^^ that is analytic at the origin, 
with radius of convergence p. We denote by [z'^]f{z) the coefficient of in the 
power series development of / at the origin. Let 

a(r) = rh'{r) and b{r) = r'^h (r) + rh'{r). 

The function f{z) is said to be H-admissible if there exists a function 6: ]0, p[ — > 
]0, 7r[ such that the following three conditions hold: 

(HI) limr^pb{r) = +00. 

(H2) Uniformly for 1 6*1 < 6{r) 

f{re'% ~ /(r)e^^"('')-^^''^(0 ^hen r tends to p. 

[That is, f{re'^) = /(r)e^''"(^)-3'''''M(l + 7(r, ^)) with |7(r,^)| < 7(r) 
when \9\ < 6{r) and linv->p7(?') = 0.] 

(H3) and uniformly for 6{r) < \9\ < vr 

f{re^^)\/b{r) = o{f{r)) when r tends to p. 



Hayman's theorem jTT[ Theorem VIII.4] states the following. 

Theorem 4.11 Let f{z) = e^^^^ be a H-admissible function with radius of 
convergence p and C, = C{n) be the unique solution in the interval ]0,p[ of the 
saddlepoint equation 

J'(0 

^7(0="- 



Then 



where b{z) = z^h"{z) + zh'{z). 
4.2.2 Proof of Proposition 14.91 

Let /C be the set of partial injections in which all the cycles have length 1 and let 
J' the set of partial injections without any cycles (a subset of /C). The elements 
of J7 are known as fragmented permutations, see [IH Section II. 4. 2]. 

Let Kn and J„ be the number of size n elements of /C and J, and let K{z) 
and J{z) be the corresponding EGS. We want to show that -j^ is equivalent to 

e 
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The series J{z) is studied in detail in [11^ Example VIII. 7, Proposition 
VIII. 4]. There, it is shown in particular that J{z) is H-admissible and that 

J(z) = exp f-^) and = ^e2v^n-i(l + o(l)). (1) 

A partial injection in /C consists of a set of length 1 cycles and a fragmented 
permutation. It follows that 



k\(n-k)l 



so that 



n=0 \n=0 / \n=0 / ^ ' 

Now is H-admissible: this can be verified directly, or by application of |1H 
Theorem VIII. 5]. We already noted that J(z) is H-admissible, and hence K{z) 
is H-admissible as well, as the product of two H-admissible functions ( [111 The- 
orem VIII. 5] again). 

The saddle-point equation ^j^^ = n (see Section [4. 2. ip is 

z(2 -2z + z2; 



n. 



(i-zy 

i.e. z^ - {n + 2)z'^ + 2{n + l)z-n = 0. 

Let Pniz) be the polynomial on the left hand side of this last equation. Ex- 
amining the sign of the derivative of Pn{z) on the interval [0, 1] and the values 
of Pn at and 1, we find that P„ has a unique zero between and 1, say Cn- 
Moreover 

This asymptotic development can be obtained using maple, based on the appli- 
cation of the Cardan method to this degree 3 polynomial. We can also observe 
the following. Let Qn{z) be the polynomial defined by the identity 

Pn{l -Z) = l-Z+{1- n)z'^ -Z^ = Qn{z) - Z^ . 

The zero of Qn{z) in the interval [0, 1] is 

V4n -3-1 1 1 _i. 
l[n — \) y n In 

and if = 1 — a„, we have Pn{f3n) = — c^n) which is negative for n large enough. 
Now let 7„ = 1 - ^ + ^. Then 

Pni7n) = ^ + Oin-l), 
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which is positive for n large enough. It follows that Pn < Cn < In, justifying 
the development in ([2]). 

With the notation of Section 14.2.11 we also have 

(f d 
b{z) = z^-ij^ log K{z) + z— log K{z) 

_ z{2 -2z + 3z^ - z^) 
SO that 6(Cn) = 2n^/^ + 0{n). 



Then we have 

logc = „iog (i - -L + i- + o = + o (-L 

and K(C„) = exp U, + ~ 
By Theorem 14. IH we now have 

Proposition 14.91 follows since [z"]I{z) ~ f^^^^^"^''^ (Proposition I3.9P and 
hence, ~ -f=. 

4.3 A remark on the Hanna Neumann conjecture 

The Hanna Neumann Conjecture (HNC), recently established by Mineyev [23] 
after several decades of partial results, deals with the rank of the intersection 
of finitely generated subgroups of free groups (see also [M] for an alternative 
proof, purely in terms of groups and graphs). For convenience, let the reduced 
rank of a subgroup H, written rk{H), be equal to 

rk{H) = max(0, rank(i7) - 1). 

Mineyev's theorem states that, if H and K are finitely generated subgroups of 
F, then rk{H n K) < rk{H)rk(K), as conjectured by Hanna Neumann. It also 
shows the stronger inequality conjectured by Burns |7j, formerly known as the 
strengthened Hanna Neumann conjecture (SHNC): 

rk{H n K^) < rk{Hyk{K), 

where the sum runs over all subsets KgH (in K\F/H) such that H n ^ 1 
and = g-^Kg. 

We discuss here how our results show that the cases where this inequality 
is non-trivial are rare, in the sense that rk{H n K) and ^ rk{H n K^) are 
generically equal to zero. 
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It was observed, initially by Stallings [32] and Gersten [13], that HNC and 
SHNC have natural interpretations in terms of Stallings graphs. If F is an A- 
labeled graph, let us denote by the difference between the number of edges 
and the number of vertices of F: thus rk(H) = x(F(//)). Let A{H,K) be the 
graph obtained from T(H) and T{K) as follows: the vertices of A(H,K) are 
the pairs {u,v) such that u is a vertex of T{H) and v is a vertex of T{K); and 
the edges of A{H,K) are the triples {{u,v),a, {u',v')) such that {u,a,u') is an 
edge of T{H) and {v, a, v') is an edge of T{K). 

Let Ai be the connected component of A{H,K) containing (1,1) (where 
1 denotes the origin of T(H) and of T{K)), and let A2 be the union of the 
connected components of A{H, K) which are not trees. Then HNC holds for 
H and K if and onlyjf x(Ai) < 7v{H)7v.{K), and SHNC holds for H and K if 
and only if x{^2) < rk{H)rk{K). 

Now observe (as in Proposition 13. 7p that a randomly chosen {k + £)-tuple of 
elements of Rn is composed of the juxtaposition of a randomly chosen A:-tuple 
and a randomly chosen £-tuple. Exponentially generically, such a /c-tuple h 
and ^-tuple h' generate subgroups with trivial intersection: in particular, HNC 
holds exponentially generically in the word-based distribution. 

In fact, with the same ideas as in the proof of Theorem 14.31 exponentially 
generically, there is no loop in F((/i)) with an occurrence as a loop in F((/i')). 
Therefore SHNC holds exponentially generically. 

As mentioned above, both HNC and SHNC are now known to hold, but 
it seems interesting to point out that exponentially generically, they hold for 
trivial reasons. 

5 An intermediate property 

In this section, we discuss an intermediate property of subgroups, that is a 
property such that the proportion of subgroups of size n with this property 
has a limit which is neither nor 1 (respectively the negligible and the generic 
cases). 

Theorem 5.1 The probability that a random size n subgroup of intersects 
trivially the conjugacy classes of the generators tends to e~^' when n tends to 
infinity. 

The discussion of this property is included here because we do not know 
many examples of such intermediate properties. Unfortunately, the property 
in question is geometric in the sense that it depends on the combinatorial pa- 
rameters of the Stallings graph of the subgroup, and is not preserved under 
the automorphisms of Fr. It would be interesting to exhibit such a property 
that would be algebraic (preserved under automorphisms). One might think 
for instance of the property of avoiding the conjugacy classes of all the elements 
of some basis of Fr, or the property of avoiding all primitive words. 

Remark 5.2 The property described in Theorem lS.ll is exponentially negligible 
in the word-based distribution. Indeed, if /i is a A;-tuple of reduced words of 
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length at most n, then T{{h)) has exponentially generically k loops of length at 
least I and no loop of length 1 (see the discussion in Section 13.11 with a = | 
and A = |). □ 



The rest of this section is devoted to the proof of Theorem 15. 1[ It is easily 
verified that a subgroup H contains a conjugate of letter a £ A if and only if a 
labels a loop at some vertex of T{H), that is, if and only if the corresponding 
partial injection has some fixpoint. Since the drawing of the partial injections 
corresponding to the different letters is independent, the theorem follows di- 
rectly from the following proposition. 

Proposition 5.3 The probability that a size n partial injection has no fixpoint 
tends to ^ when n tends to infinity. 

Remark 5.4 Note that ^ is also the limit of the probability that a size n 
permutation has no fixpoint (a so-called derangement, see [I^ ) . □ 

Our proof of Proposition 15.31 again uses Hayman's theorem (Section I4.2.ip . 
We also need the following technical result. 

Proposition 5.5 Let fo{z) be an H- admissible function with radius of conver- 
gence p < oo. Then f{z) = e^^ fo{z) is H-admissible as well. 

Proof. Since /o is analytic at the origin, it is clear that f{z) is analytic at the 
origin as well, with a radius of convergence equal to that of fo{z). 

Let h{z) be such that f{z) = e^^^\ If h{z) = ho{z) — z, then we have 

Let ao{t) = thQ{t), a{t) = th'{t), bo{t) = t^h'^it) + ao{t) and b{t) = t^h"{t) + 
a{t). Then a{t) = ao{t) — t and b(t) = &o(0 ~ 

It is immediate that limt-^pb{t) = +oo since this limit holds for bo. That 
is. Condition (Hi) holds. 

We now verify Condition (112). Let 5{t) be a positive function such that 
limt~^p6{t) = 0; and such that, uniformly for \6\ < S{t), and as t tends to p, 

ho{te'') = ho{t) + iOaoit) - \e%o{t) + o{l). 

Then 

/i(te^^) = /io(te*^) -te^^ 

= hoit) + i^ao(t) - \ehQ{t) + o(l) - te'^ 

= h{t) + iea{t) - ^9^b{t) + o(l) - te*^ + t + ti9- ^tO^. 

We now observe that, if |^| < 5{t) and as t tends to p, then \ti6\ < t5{t) = o(l) 
and similarly, ^tO'^ = o(l). Finally, 

|t(l - e'^)| = tJ {I- cos ey + sm^ 9 = ^2(1 -cos^) < t\e\ < td{t) = o(l). 
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Thus h{te'^) = h{t) + iea{t) - \eH{t) + o(l) uniformly for \e\ < which 
concludes the verification of (H2). 

f(te^^)^/b(t) 

Finally, we want to show that tends to when t tends to p, 

uniformly for 6{t) < \9\ < vr. We have 

J9\ /U7\ f (+„iS\„-te 



fit) /o(t)e-* 



t 



hit) V boity 

Since /o is H-admissible, uniformly for d{t) < \9\ < it and as t tends to p, 



foii) 

Moreover, ^1 — = 1 + o(l) since lim(_j.p 6o(t) = +oo. Finally, when < 

t < p, le^^i-^'')] = e^^i-™*^^) < e^P. This suffices to conclude that (H3) holds, 
and hence that f{z) is H-admissible. □ 

Proof of Proposition 15.31 Let C be the set of partial injections without 
fixpoints (i.e., without size 1 cycles), let L„ be the number of size n ele- 
ments of C and let L{z) be the corresponding EGS. We want to show that 

Ln = i/„(l + 0(l)). 

The EGS L{z) is computed using the standard calculus of enumeration of 
labeled structures (displayed in [Til Figure 11-18]), which was already used to 
compute I{z) in [3]: since the EGS of cycles is log(Y^), the EGS of cycles of 
size at least 2 is log(Y^) — z and the EGS of non-empty sequences is we 
have 

1 . z \ 1 ( z'^ 



Liz) = exp (^log(^) - - + r^lj = ^T^lJ = 'i^y-'- 

We already know that I{z) is H-admissible ^ Lemma 2.8] and Proposition 
shows that L{z) is H-admissible as well. 

The saddlepoint is the solution (n in the open interval ]0, 1[) of the equa- 
tion = n. An elementary computation shows that we need to solve the 
equation 

z^ + {n- l)z'^ - {2n + l)z + n = 0, 
i.e., (z + n + l)(l-z)2-l = 0. 

Letting z = and z = 1 in this equation shows that there is a solution in 
the interval (0, 1); moreover, one verifies easily that (z + n + 1)(1 — z)"^ — 1 is 
monotonous on (0, 1), and hence our equation has exactly one solution in that 
interval, say, Cn- From < < 1, we deduce that < (1 — Cn)^ < and 

hence 1 - < Cn < 1 - In particular, Cn = l-^ + (;^) . 
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It now follows from Theorem 14. 1 II that 

[z-]L{z) = -J§^{l + o{l)). 

CnV27r6(Cn) 

In view of the proof of Proposition 15.51 b{t) = bo{t) — t, where is the corre- 
sponding function for the H-admissible function I{z). Using [3l Equation (7), 

p. 392], we find that bo{t) = jj^, and hence b{t) = ^^^"^"a"^*'^ 
Elementary computations show that 

6(Cn) = 2ni f 1 + O ^ ^ 



and , ^ = (l + O ( 
Moreover, = exp ( —n log ( 1 -= + O 



n \ nWn 



exp(^+i + o(^ 



Finally, L(Cn) = V^exp - 2 + O (^)) ^ ^ (n)) ' 

3 

At last we have: [z"]L(z) = ^-^n-^^^ (^1 + O • 

Comparing with the estimate of in Proposition 13. 9^ we find the 

announced result, namely 

L„ [z^]L{z) 1 

□ 



6 Finitely presented groups 

One of the motivations for the study of subgroup distributions has been the 
investigation of the statistical properties of finitely presented groups, see \14:\ 
El El [281 El I]- Strictly speaking, this would require a notion of distribution 
of these groups, so that one would make a list of non-isomorphic groups and 
investigate the frequency of groups with certain properties within that list. No 
such notion is available, as far as the authors are aware and current literature 
operates rather with a notion of distribution of finite presentations. 

Recall that a finite presentation is a pair {A, R), where A is a finite set (the 
alphabet of generators) and i? is a tuple of elements of F{A) (the relators). 
The resulting finitely presented group G, written G = {A \ R), is the quotient 
G = F{A)/N{R), where N{R) is the normal subgroup generated by R. The 
usual approach of statistical properties of finitely presented groups is based on 
the uniform distribution on /c-tuples of reduced (or cyclically reduced) words of 
length at most n. 



21 



Of course, different presentations may yield the same group, even if the 
alphabet of generators is fixed. We are not aware of an analogue of Proposi- 
tion [SSI above, which would state, say, that the distribution of finitely presented 
groups with k relators of length at most n resulting from the uniform distribu- 
tion on /c-tuples of reduced (or cyclically reduced) words of length at most n, is 
uniform, at least on a generic subset of fc-tuples. However, partial results exist 
in this direction for one-relator groups (Kapovich, Schupp and Shpilrain |18j . 
Sapir and Spakulova [30l [31] ) . 

In this section, we want to discuss an idea that may seem reasonable in this 
context, but which turns out to be disappointing. If H is the subgroup generated 
by the tuple of relators R, then N{R) = N{H), so the group G = {A \ R) is 
also specified by the pair {A \ H). Thus, instead of looking at the normal 
closure of a finite set of elements, we look at the normal closure of a finitely 
generated subgroup. Now, clearly, if one generates a list of subgroups H by 
listing /c-tuples of generators (the word-based distribution discussed earlier in 
this article), then the distribution of groups produced by this process will be 
the same as if one were working with presentations. 

The idea we wish to explore is to generate the subgroup H via its Stallings 
graph, that is, to use the graph-based distribution of subgroups. Precisely, 
we may present groups via pairs, {A \ T) where A is an alphabet and F is a 
Stallings graph. This is a priori a more compact representation of the group 
(more compact in bit size, less convenient to ET'[i]X)(j More importantly, as we 
have seen that the graph-based distribution of subgroups is different from the 
word-based distribution, we may anticipate a different distribution of finitely 
presented groups as well, which would give us different insights on finitely pre- 
sented groups. 

Now an interesting feature of the statistical study of group presentations by 
tuples of relators is that the groups produced are generically non-trivial, and 
in fact infinite. More strongly, if A and k are fixed and if the maximal length n 
of the relators in the fc-tuple R tends to infinity, then generically G = {A \ R) 
is such that every subgroup generated by 1^41 — 1 elements is free [.2j. It is also 
known that G is generically hyperbolic (Ol'shanskii |^28j and Champetier [8l[9], 
proving a statement of Gromov [14J). 

In sharp contrast, and somewhat disappointingly, generically, a finitely pre- 
sented group of the form {A \ T) is trivial. 

Theorem 6.1 Generically, the finitely presented group {A \ T) is trivial. In 
other words, generically, the normal closure of a randomly chosen subgroup of 
Fr of size n, is itself. 

The rest of Section [6| is devoted to the proof of Theorem 16.11 We note that 
if the lengths of the cycles of the partial injection induced by letter a in T{H) 
are relatively prime, then a belongs to the normal subgroup N(H), and hence 
a = 1 in G = {A \ H). Thus it suffices to prove the following proposition. 

^It would be more interesting to have a unique, discrete representation of finitely generated 
normal subgroups, but no such representation seems to be known. And distinct normal 
subgroups may well lead to isomorphic quotients. 
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Proposition 6.2 Generically, the lengths of the cycles of a size n partial in- 
jection are relatively prime. 

Remark 6.3 Our proof that {A \ H) is generically trivial relies on a rather 
rough upper bound: we show that generically with probability 1 — 0{n~'s), 
each letter a is a product of conjugates of powers of a in ff. We do not know 
whether {A \ H) is exponentially generically trivial. See Remark 14.101 for a 
similar situation. □ 



6.1 The permutation case 

We start with the case of permutations, which is interesting in and of itself. 
Observe that if the lengths of the orbits of a permutation are not relatively 
prime, then these lengths have a common prime divisor p, which is in particular 
a divisor of n. Let Vn be the set of size n permutations in which all the orbits 
have size a multiple of p. 

Lemma 6.4 Let n > 2 and let p he a prime divisor of n. Then 

Proof. We fix p, so n is of the form n = mp and we proceed by induction on 

(p) 

m. If m = 1, that is, p = n, then \Vn \ is the number of size n cycles, namely 

(n — 1)!. We now assume that m > 1. 

(p) 

We enumerate the elements of Vn in terms of the size kp of the orbit of 1 : 
to determine such a permutation, one needs to select the other kp—1 elements of 

that orbit, select a cycle on these kp elements, and select a permutation on the 

(p) 

remaining elements, that is, an element of 'P^_f.p. Thus, using the convention 
that I = li we have 



k=l 
m 



^ in - 1)! (p) 



■m-l |-p(p) 

(n-l)!E 



Isolating the term j = and using the induction hypothesis, it follows that 

(m—l \ / m—l 

l + 2^(j»p-M =(n-l)! il + 2pp-^Y.J~''^ 

Since the map x xp is non-increasing on positive reals, we have 
{j + l)p < xp dx = p{{j + l)p — jp). 
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Therefore, isolating the term j = 1, 

m—l m—1 



m-2 



= l+^(i + l)i-^ 

< l+p{{m - l)p - 1) 

< 1 + p{mp — 1) = p pup—p+l. 



Now we have 



\Vi^^ < {n - 1)1 (l + 2pp ^ (p^ lnl-p+1 
<{n- 1)! (2np + 1 - 2pp + 2pp~^ 

Since p > 2, it holds 

1 i_i 1 / 1\ ill 

np — nP = DP [ \ > r)P — > — 

^ ^ pj ~ 2 - 2 

and hence 

\V^P^\ < (n - l)!2np = 2n!np~\ 
which concludes the proof. □ 

Proposition 6.5 The probability that the lengths of the orbits of a size n per- 
mutation are not relatively prime is at most equal to + 2n~3 logg n. 

Proof. Let Q„ be the set of size n permutations for which the lengths of the 
orbits are not relatively prime, and let = 

As we already observed, a common divisor of the lengths of the orbits of a 
size n permutation is also a divisor of n. Therefore, if n is prime, then Qn is the 
set of size n cycles, so \Qn\ = (n — 1)!, g„ = ^ and we have the desired result. 

(p) 

If n is not prime, then every size n permutation in Qn is in Vn for some 
prime divisor p of n. These sets are not pairwise disjoint, but the sum of their 
cardinalities is an upper bound for \Qn\- For these values of p, \Vn^\ < 2n\np ^ 
by Lemma |6.4[ Separating the case p = 2 from the cases p > 3, we find that 
Qn ^ + 2Z)n~3, where D is the number of distinct odd prime divisors of n. 

Since n > 3^, we have D < logg n and hence 

< -7= + 2Dn"§ < + 2n~3 logon, 
\/n Jn 



which concludes the proof. □ 
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6.2 Proof of Proposition 16.21 

Isolating the cycles in a size n partial injection, reveals a permutation (on a sub- 
set X of [n]) and a fragmented permutation (i.e., a cycle-less partial injection) 
on the complement of X. 

The EGS J{z) = ^-2" of fragmented permutations was discussed in 

Section 1121 where we noted in particular that J{z) = exp (^j^^. Let us add 
the following observation. 

Lemma 6.6 The sequence {Jn/n\)n>Q is increasing. 

Proof. Let M„ = ^, so that J{z) = En>o^n-^"- The equalities -^J^z) = 
{i-z)'^ J(-z), and hence (1 — z)'^-^J{z) = J{z), yield the following recurrence 
relation, for all n > 2: 

(n + l)M„+i = (2n + 1)M„ - (n - l)M„_i. 

It follows that, for all n > 2, 

(n + l)(M„+i - M„) = nMn - (n - l)M„_i = n(M„ - M„_i) + M„_i. 

The result follows by induction since Mi = 1 and M2 = | (see for instance 
Section II.4.2]). □ 

Specifying a size n partial injection whose permutation part (the union of 
the cycles) has size k, amounts to choosing k elements, choosing a permutation 
on these k elements, and choosing a fragmented permutation on the remaining 
n — k elements: the number of such partial injections is 

k\Jn-k = n\- "^"^'^ 



kj " (n — A;)! ' 

and the number of those in which the sizes of the cycles have a non-trivial gcd 
is at most equal to 

2n! '^"-^ (± + }^^ 



\n-ky.\^ kl 

by Proposition 16.51 Moreover, summing the numbers of partial injections with 
permutation part of size k, we get 

fe=0 ^ ' 

We use these observations to show the following facts, which together suffice 
to establish Proposition 16.21 



Fact 6.7 The proportion of size n partial injections whose permutation part 
has size less than na is 0{n~'^). □ 
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Fact 6.8 The proportion of size n partial injections whose permutation part 
has size greater than ns and for which the sizes of the cycles has a non-trivial 
gcd, is 0{n~6). □ 



Proof of Fact 16.71 The proportion of size n partial injections whose permu- 
tation part has size less than is 



< (n3 -1-1)-— by Lemma 16.61 



The last inequality holds since = ©(n^a) (compare the asymptotic equiva- 
lents of ^ given in Section 14.2.21 and of ^ in Proposition 13. 9p . □ 

Proof of Fact 16.81 Here we use Proposition 16.51 and the fact that, for large 
enough integers, we have —7= H < —tt- The number of size n partial 

Vk Vk 

injections whose permutation part has size greater than n3 and for which the 
sizes of the cycles has a non-trivial gcd, is bounded above by 



1 Jn- 

n\- 



{n-k)\ 

k=\n^i^ k=\n^ 

n 



k=0 



Jn—k . — i 

[n — k)\ 



Thus the proportion of these partial injections is at most 4n e . □ 
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